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developed to describe this pattern, which, when tested, explains 
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The Diffusion o-f Academic In-format i on : A Mathematical Model o-f Citations in 
the Sciences, Social Sciences and Arts and Humanities 



ABSTRACT 



This paper examines the pattern o-f diffusion in the academic 
literatures of the Sciences, Social Sciences and Arts and Humanities based 
on citations. An examination of the citations of articles from a given 
year to the year in which the cited article was published reveals a pattern. 
The percentage is initially small when there is no lag between the years. 
It then increases, reaching a peak in less than two years. Then, it 
gradually decreases over time. A mathemat ical model was developed to 
describe this pattern, which when tested explained between 96.9 and 98. 3X of 
the variance depending on the data set. The results are interpreted as an 
example of social learning and forgetting. 



The Diffusion of Academic Information: A Mathematical Model of Citations in 
the Sciences, Social Sciences and Arts and Humanities 

Theory 

One of the central areas of study within Communication is the diffusion 
of innovation and the dissemination of information. Rogers (1983:11) 
defines an innovation as, "an id*a, practice, or object that is perceived as 
new by an individual or other unit of adoption. ...The 'newness' of an 
innovation may be expressed in terms of knowledge, persuasion or a decision 
to adopt." In the academic environment, publ i shed manuscr i pts in the form 
of books or journal articles present new ideas. While they are generally 
considered channels for disseminating information about innovations, they 
are, in fact, innovations themselves; new objects capable of initiating 
cognitive change in adopters (Barnett, 1978). 

Academia is often characterized as an invisible college or a "highly 
interconnected network of scholars who share a theor e t i cal -met hodol og i cal 
framework" (Crane, 1972). Essential to the maintenance of this network is 
the sharing of ideas and knowledge. This takes place through academic 
literature (Price, 1963). The process of communication among academics may 
be described by the practice of scholars referencing the published research 
of one another. Citations represent communication links among academic 
researchers. In fact, networks of scientists have been studied using 
citations as their relationship (Price, 1945; Goffman, 19<4<4'j Small & 
Griffith, 1974; Garfield, 1979; Brittain, 1985). They also represent 
manifestations of the adoption process serving as indicators that an 
innovation, a new idea, was adopted by an individual scholar. Thus, the 
practice of referencing another's published research may be used to describe 
the process of the diffusion of innovations in the academic community. 
Clique integration and individual connectedness are related to the 
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adoption of an innovation (Rogers & Kincaid, 1981). When researchers are 
connected to one another, it -follows that the new ideas would be adopted 
more readily by those within his/her network than by those outside the 
network's boundaries. The adoption of the innovation would be expressed in 
terms o-f a scientist's knowledge as manifested in citations o-f academic 
articles. In a sense, these networks are based on shared adoption o-f 
innovations. 

The growth o-f knowledge may also be described by the citation process. 
Price (1965) reports exponential growth o-f the body o-f world literature -for 
the last -few centuries. This growth rate is expected to continue at its 
present rate at about 5X per year. There will be about seven new papers 
each year -for every 100 previously published papers in a given -field. 

Price (1945) describes the networks that emerge -from citation o-f 
scientific papers. He -found that in an average year, 35X o-f all existing 
papers are not cited and that 49X are cited only once. The percentages then 
decrease until only IX o-f papers are cited six or more times. Patterns 
exist within networks, such that multiple relationships emerge out o-f hal-f 
o-f the references from new papers. These new papers cite about half of the 
papers that were published previously. "Thus, each group of new papers is 
'knitted' to a small, select part of the existing scientific literature but 
connected rather weakly and randomly to a much greater part" (Price, 
1965:512) ■ 

This suggests that the change of paradigms, theoret i cal -me thodolog i cal 
frameworks, may be described by which articles are referenced (Kuhn, 1970). 
That is, certain seminal articles are published and cited. Their frequency 
of citation as an indicator of the importance of the manuscript and the 
degree of acceptance of the paradigm (Garfield, 1979). The citation 
practice takes on specific characteristics over time. For any given article 



or area of study, the -frequency of citation grows to a peak be-fore it drops 
off. Burton and Kebler (I960) and McRae (1969) report that the number of 
citations a manuscript receives decays exponentially with time. The drop 
off in citation takes place as the paradigm shi-fts. Goffman (1964) 
described this pattern -for investigation o-f the mast cell. Dieks and Chang 
(197<£) report a similar pattern in the literature o-f magnetic resonance. 
Thus, the pattern which describes the practice o-f citation may be used to 
develop models o-f the di-f-fusion o-f innovat ions, and the dissemination o-f 
knowl edge . 

The study o-f communication networks among academics and how they change 
over time is made possible by citation indexing (Institute -for Scientific 
Information, 1982, 1984a, b). Citation indexing is the practice o-f 
systematically recording and organizing the references that authors make to 
previously publ ished manuscripts. Almost all authors o-f academic articles 
and books make reference to the work of other authors. The references 
indicate that the cited authors support, elaborate, or illustrate some 
idea(s) presented by the author. Garfield (1979: 1) describes citations as 
"formal, explicit linkages between papers that have particular points in 
common. ■ Referencing another's work is considered . holarly practice 
within the academic environment. For this reason, bibliographies, and 
therefore, an index of ideas as presented in the Science Citation Index 
(SCI), the Social Science Citation Index (SSCI) and Arts and Humanities 
Ci tat ion Index (AHCI) are powerful means of search effectiveness. 
Economically, it is impossible to use the references from all journals. 
However, the Citation Indicies identify several thousand journals from all 
the academic disciplines which are considered to publish the highest quality 
material. From these journals, all of the references are indexed, 
regardless of who wrote it, when it was written or where it appeared. By 



this me thod, SCI , -for exampl e , is abl e to annual 1 y index 7 mi 1 1 i on 
references -from 3,000 to 4,000 journals and books -from all scientific 
disciplines. Because of the extensive overlap of the areas within the 
disciplines, this method of indexing "accounts for an overwhelming major i ty 
of the material important enough to be referenced or abstracted" (Garfield, 
1979:21). 

Garfield stresses the advantages of citation indexing in terms of cost 
and simplicity over subject and title word indexing. The indices are 
compiled such that Key papers, all presenting ideas in the same area of 
study, would be located together. In a subject index, if the name of this 
same idea had changed over time, the search for key papers could be quite 
difficult. An article in the SCI is not limited to just one subject area, 
however, "each reference citation is associated with as many subject 
meanings as other scientists attribute to it" (Garfield, 1979: 10). 

Garfield (1979) describes the use of the Science Citation Index (SCI) 
for defining historical accounts of the development of innovative ideas in 
the scientific field. By networking citation behaviors of researchers in a 
specific area of knowledge, relationships and events can be identified, and 
the route of the innovation mapped. Garfield (1979) demonstrated the use of 
the SCI in identifying the links between academic researchers by the 
networking of citations. Scientists involved in co-citation at a perscribed 
frequency and strength were considered to have a mutual relationship. 
Clusters of authors were identified for specific areas of research, each 
scientist having mutual influence with at least one member within the 
cluster . 

This paper develops a mathematical model which describes how ideas 
manifested in academi . articles are diffused within academic disciplines, 
specifically, the Sciences, Social Sciences, and Arts and Humanities. It 
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•focuses on citations as the innovations. 
Introduction to Model 

The diffusion process is typically described by an S-shaped curve in 
which the cumulative numbers of adopters is plotted with respect to the time 
of an innovation's adoption (Rogers, 1983). The distribution of adopters 
initially rises slowly. The curve has a small positive slope. It then 
accelerates to * nential growth to a maximum until haH o-f the population 
adopts the innovation. It then increases at a decreasing rate. Although 
still positive, the slope approaches zero. The curve becomes a decaying 
exponential, becoming asymptotic with the number o-f members o-f the adopting 
population. The -frequency o-f adoption at any single point may be described 
over time by a bell-shaped normal curve. Mathematical descriptions o-f these 
curves are presented by Barnett (1978). 

There are a number o-f problems with di-f-fusion research and thus, the 
mathematical description o-f the process. Rogers (1983) suggests a number o-f 
criticisms including the pro-innovation bias o-f di-f-fusion research, the 
•focus on the individual rather than the system in which the process takes 
place and the lack o-f longitudinal investigation. These problems are 
addressed in this paper. 

The pro-innovation bias is the implication o-f most di-f-fusion research 
that any innovation should be di-f-fused and adopted by all members o-f a 
society. Further, the innovation should be di-f-fused more rapidly and that 
the innovation should be neither re-invented nor rejected (Rogers, 1983:92), 
One manifestation o-f this bias is the -focus o-f di-f-fusion research on 
adoption rather than disadoption. Rogers (1983: 21) labels this 
discont inuence, M a decision to reject an innovation after it had previously 
been adopted". There has been relatively little research designed to 
investigate the nature of discontinuance, and as a result relatively little 
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is Known about this aspect of diffusion behavior. Rogers (1983) identifies 
two types o-f discontinuance, replacement and disenchantment, A replacement 
discontinuance is a decision to cease using an idea in order to adopt a 
better idea which supersedes it. Certain academic fields change rapidly. 
Particularly in the sciences there are constant waves of innovations. Each 
new idea replaces an existing practice which in its day was also an 
i nnovat i on . 

A disenchantment discontinuance is a decision to cease using an idea as 
a result of dissatisfaction with its performance. The dissatisfaction may 
come about because the innovation is inappropriate for the individual and 
does not result in a perceived advantage over an alternative practice. 

Both types of discontinuance may occur in the pursuit of academic 
Knowledge. A change in research paradigm may bring about the discontinuance 
of the use of a literature because the ideas have been rejected in favor of 
better research justification or because the ideas in those articles were 
simply inappropriate (Kuhn, 1970). 

Diffusion research has traditionally focused on the individual adopter 
rather than the system in which the process takes place. Individuals are 
generally treated as passive receivers of information about an innovation. 
This has led to an overreliance of the examination of role of psychological • 
variables in the adoption process. Seldom are the investigations about the 
sources or channels by which the information about the innovations is 
di ssmemi nated. Journal articles are one of the sources academics use to 
gather information about new ideas. Thus, an examination of the pattern of 
citations of academic journals may provide new insights into the diffusion 
process. 

Further, the use of individuals as the unit of analysis has led to 
imprecise descriptions of the diffusion process. An alternative would be to 
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psychological states of individual adopters. Rather, it -focuses on 
disadoption as well as adoption, on an information source or channel used by 
social networks in the aggregate. Further, it is longitudinal, examining 
the diffusion process over time based on data from archival records. 
Mathematical Model of Disadoption 
Rogers (1983) suggests that researchers can investigate how a practice 
is discontinued. Almost as an after-though he presents a graphic 
representation of the "discontinuance curve 11 . It describes a decaying 
exponential. Coleman (1969) and Hamlin, et al . (1973) provide precise 
mathematical descriptions of the curve. A decaying exponential has been 
empirically observed for the use of information over time by Goffman, 
(1966), Dieks and Chang (1976) and Levy and Fink (1984). 
The Model 

The distribution of citations may be modelled as, 

y(t) = a (exp C-b(t+d)] - exp C-c(t+d)]) 
where y(t) is the proportion of citations made in a given year to papers 
published t years previous, and a, b, c, anc d are non-negative constants, 
with c>b. When t + d equals zero, y(t)= 0, and as t — > infinity, y<t)~ > 0. 
Thus, citations are presumed to start at time equal to -d. This is so since 
articles are often cited prior to their actual publication date; the d 
parameter corrects for this situation as much as possible, given the data 
set. The parameter d is the time, in years, prior to publication at which 
citation is initiated. Given the data set (described below), it is unlikely 
that d can be greater than +1. 

The model assumes that the citation process involves an initial 
increase in the aggregate probability of citation, followed by a decline 
which eventually reaches zero. Even seminal articles decline in citation 
frequency after a peak, since new ideas ultimately become common knowledge 
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of a field or discipline, and after a period no longer has the need for the 
active researcher to cite its originator. Of course, fields differ in this 
characteristic, and work which is fundamentally of an historical nature will 
necessarily have citations to primary sources which may be quite old. 
Furthermore, fields that can present new knowledge as "di scover i es, " which 
makes old knowledge out of date, are less likely to have citations that go 
back very far in time. Thus, the discipline type may differ in the extent 
that they cite works at different lags. It is hypothesized that the 
Sciences are least likely to cite work in the distant past, the Social 
Science next, an the Arts and Human i t i es more likely (Garfield, 1979; Koshy, 
1976). 

One way to evaluate the trajectory of citations over time is to find 
that point in time on the curve derived from the model at which the 
proportion of citations is at a maximum. This point, which we call t H , 
i s 

t* =<(1og c -log b)/ (c-b)) -d, 
with the maximum proportion of citations <ynAX> being 

yH ax = a{<c-b)/b)<b/c)<c/<c-tn 

This model has been used to model drug concentration in the blood, 
which is also a process that starts at zero, rises, and then returns to zeri 
as time increases (Burghes & Wood, 1980). Diffusion curves, as already 
pointed out, do not allow for disadoption over time, which is a fundamental 
aspect of the problem here. Epidemic models, which have also been used to 
model the process here (Goffman, 1966) assume that once one is in the pool 
of 'infected* (i.e. once one has cited a paper at a particular ' g) , one is 
no longer eligible to be "reinfected" (to cite a paper at a given lag 
again). Finally, a model derived from a Poisson distlbution (Dlek* & Chang, 
1976) assumes that citations are independent, which is almost certainly 
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unrealistic; to compensate 'for this, the authors did not count citations, 
but the number of different citing authors to the same paper by a given set 
of authors. Even doing this fails to eliminate the dependency problem. 

In sum, the model proposed allows the citation proportion to go from 
zero to a positive value and back to zero. It provides for the prediction 
of the point in time at which citation is set at a maximum <t H ), and 
also for that citation proportion ( y m ax ) • Utilizing this model, 
comparisons between fields is possible and useful, since the extent that the 
model's parameters differ across fields may be examined. The reader should 
take note that the model presented here uses articles written in a given 
year and examines the pattern of citation made by these articles to articles 
previously published. An alternative approach to the proposed model would 
be to take articles published in a given year and look at citations to them 
in subsequent years. 
Methods 

Data Source 

The data used to test tne model comes from the Guide and List of Source 
PubMcations (Institute for Scientific Information, 1982, 1984a, b) of the 
Science, Social Science and Arts and Humanities Citation Indices. These are 
extensive data bases which provide accurate and stable measures of the over 
time citation process. In 1984, 3,281 scientific journals and monographs 
were examined. These contained 361,989 articles and book reviews which 
cited 8,911,6?<4 articles and mongraphs. In the same year, 4,653 Social 
Science journals were examined. They contained 57,301 artices and 36,416 
book reviews which cited 1 ,460 ,363 manuscr i p ts. In 1982, the Arts and 
Humanities Citation Index, examined 5,418 monographs and journals which 
contained 30,893 articles. There we.^e 756,460 unique citations. 

The specific data used to test the model were the percentage of unique 
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citations -for a given publication date. For example, what percentage o-f the 
citations -from 1984 articles were originally published in 1983, 1982, and so 
on? For the Sciences, the annual percentage o-f citations were presented 
between 1950 and the year the citations were indexed -for the years 1941 to 
1984. Thus, there were 564 data points. Between 72.82X <19<41> and 95. 99* 
(1984) o-f the scientific citations were printed during this time span. For 
the Social Sciences the data also went back to 1950, but only -for 1969 to 
1984. There were 440 data points. Between 80.49X (1969) and 90.47X < 1 984) 
of Social Science citations were published during this period. For the Arts 
and Humanities the annual data went back to 1800 -for the years 1 976 to 1982, 
resulting in 1260 data points. Between 96A6V. (1977) and 98.14'/. <1982) oi 
dated citations in the Humanities i-?re published a-fter 1800. Thus, these 
data provide the length o-f time -for the di-f-fusion and discontinuance of 
academic information. The actual data is presented in an appendix. 
Analysis Procedures 
SPSS NONLINEAR was used to evaluate the model. It requires the user to 
estimate the model parameters be-fore it can provide the best -fit and 
estimates -for the parameters. The initial est imat i ons were as -follows; 
a = 1 .0 

b = -1.0, with an upper limit o-f 0.0, to corn-form with the 
theoretical assumptions o-f the model. 

c = -.1,0, with an upper limit o-f 0.0, to corn-form with the 
theoretical assumptions o-f the model. 

d = +0.5 The unit o-f time in this data set was the year. A 
percentage o-f the citations occurred with a time lag o-f zero. This suggests 
that the value required to set t equal to zero (to account -for the lag) was 
0.0 < d < 1,0. Thus, the middle value, +.5, was chosen -for the initial 
est imat e . 
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There are di f f erences among the curves -from the three disciplines 
<Garfield, ,1979). Due to the nature of the -fields, the diffusion process in 
the Sciences occurs faster than the Social Sciences, which, in turn, occurs 
faster than the Arts and Humanities. Scientists cite a greater percentage 
of more recent articles. Thus, their curve reaches a higher peak in a 
shorter period of time as they adopt new ideas. Disadoption starts earlier 
as the articles in use are replaced by new up-to-date ones. In the Arts and 
Humanities, it takes a long period of time for ideas to be adopted and 
longer for the disadoption of an idea. Their curve rises more slowly. A 
smaller percentage of citations comes from any given year. Thus, it has a 
lower peak. Also, it falls more slowly over a longer period of time. The 
diffusion process for the Social Sciences is moderate between the other 
disciplines. It rises more slowly than Science, but faster than for the 
Arts and Humanities, resulting in a peak between the two. It then falls at 
a rate between them, faster than the Arts and Humanties but more slowly than 
the Sciences. The differences in the curves should be reflected in the 
model's parameter values. A graphic representation of the relation among 
the curves is presented in figure 1. 

To evaluate the goodness of fit of the proposed model, several tests 
will be employed. The R-squared from the nonlinear regression and the 
plausibility of the derived parmeters, particularly d, t*, and yfiax, 
will be examined. Further, the residuals from the nonlinear regression for 
the model should be homoscedast i c , normal, and will not exhibit any 
systematic patterns. For a full discussion of these issues see Bauer and 
Fink (1983). To the extent that the residuals fail to meet these 
assumptions, transformation of the dependent variable will be attempted. To 
the extent that the data fail to confirm to these assumptions, regardless of 
transformation, the model wil be considered incomplete: i.e., some important 

12 



•factor that "explains" the systematic character of the residuals has been 
left out. 

Since the data consist of proportions, candidate trans-formations 
include the arc sine, log-odds and the square root trans-formation, as well 
as other trans-formations which cause the data to behave similarly. Again, 
•failure to meet the assumptions indicate above point out the incompleteness 
o-f the model . 

FIGURE 1 ABOUT HERE 

Resul ts 

The results o-f the test o-f the model -for the three data sets are 
presented in table 1. Scatterplots -for the three data sets <per cent 
citation on time) are presented in -figures 2a, b and c, The model -fit the 
da*^ very well, explaining 96.9% o-f the variance -for Science citations, 
98, -for Social Science and 97. IX -for Arts and Humanities, All 
coefficients are within the specified theoretical limits, Coe-f -f i c i en ts b 
and c are less than 0.0 and the values o-f c (-1.95, -1,35, -2.76) are 
greater than b <-.138, -.139, -.074). The values of coefficient d are all 
between 0.0 and 1.0 <.195, .139, .160). It was predicted that the 
coefficients would be monoton i cal 1 y ordered Science, Social Science and Arts 
and Humanities. This is not the case. However, they do have this relation 
if each coefficient may take on any value within its confidence limits. 
TABLE 1 AND FIGURES 2A, B AND C ABOUT HERE 

The values for t*, the point on the curve at which the proportion 
of citations are at a maximum, were 1.53 (Science), 1.74 (Social Science) 
and 1.16 Arts and Humanities). Again, the hypothesized order was not 
observed. Diffusion for the citations of the Arts and Humanities occurred 
at a faster rate than for the other two disciplines. 

The values for ynAx , the maximum proportion of citations, were 10.0 
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(Science), 9,40 (Social Science) and 4,20 (Arts and Humanities), These are 
as hypothesized. 

An analysis of the residuals raises some concern over the model. In 
all three cases, the residuals are not normally distributed. While they are 
only slightly skewed, 1,602 -for Science, -.279 -for Social Science and .16C 
for the Arts and Humanities, they are very peaked. The kurtosis was 17.69 
■for Science, 10.00 for Social Science and 15.53 for Arts and Humanities. 
Further, the residuals correlated s i gn i f i can t 1 y w i th the dependent variable 
for Science <r =-.103, p(,Q0<4) and the Arts and Humanities <r = -.091, 
p<.000). The correlation was -.023 <p<.318) for Sciences. A graphic 
representation of the residuals for Sciences is presented as figure 3. It 
reveals a pattern. The residuals are large when t is small. They become 
smaller when t becomes large. A similar pattern was for the Sciences and 
Arts and Humanities. This suggests that there may be some other parameter 
effecting the relations described by the model. 

FIGURE 3 ABOUT HERE 

An examination of the scatterplots of the per cent citation by time 
(presented in figures 2a, b & c) reveals that the pattern of the residuals is 
due to heteroscedast i c i ty. The variance in the percentage of citation is 
greater for those cases with a short lag than those with a long lag. This 
results in a skewed and peaked distribution of residuals and a significant 
correlation with the per cent citation. The points with a shorter lag had 
greater residuals. One solution to this problem would be to transform the 
dependent variable to remove the heteroscedast ic i ty from the data. 
Di scussi on 

This paper developed a mathemat i cal model to describe the diffusion 
process. The proposed model has an advantages over ones currently in use 
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□ecause 11 aescrioes me process ot disadoption In which the use of an 
innovation is discontinued. As a result, the model does not suffer -from a 
pro-innovation bias. The model was tested using citation data -from the 
Sciences, Social Sciences and Arts and Humanities. These data are based on 
aggregates rather than focusing on individuals and aro longitudinal rather 
than measuring diffusion at a single point in time. It -fit the three sets 
of citation data excellently. It accounted -for 96.9V. of the variation -for 
Science, 9B.3V. for Social Science and 97. VA -for Arts and Humanities. All 
coefficients were within theoretically specified limits. The values of 
t* suggest a more rapid diffusion for the Arts and Humanities than for 
the Social Sciences or the Sciences. ynAx indicates that the diffusion 
process is more peaked for the Sciences than Social Sciences than the Arts 
and Human i ti es. 

The model is not without its problems. Foremost is the issue of 
heteroscedast ici ty. The resi dual s were not distributed normally. For all 
three data sets, they are skewed and peaked. Further, for two data sets 
they are significantly correlated with the dependent variable, the 
proportion of citations for a given year. Several attempts wtre made to 
transform the raw data to remove the heteroscedast i c i ty . Log, square root, 
arc sine and log-odds transformations were applied without any improvement 
in the goodness of fit. Adjustments to the basic model were made to account 
for the pattern of the residuals without success. 

The pattern of the residuals and the problems of heteroscedast i c i ty 
suggests that future research must be conducted to more accurately describe 
the pattern of academic citation. This may take the path of transforming 
the raw data or changing the basic model. This research is ongoing. 

This model has implications for social learning theory (Bandura, 1977). 
Academic research may be considered an example of social learning. Students 
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are taught research procedures through exemplars, classic experiments which 
serge as models -for them to observe (Kuhn, 1970). Scholars learn -from 
another by reading each others research publications. Research manuscripts 
in print serve as models -for -future work. The individual scholar extracts 
the essential elements of the research paradigm -from published manuscripts 
and perform similar research. The learner adopts the ideas in the articles 
to meet their individual research needs. 

Social learning theor> makes an important contribution to diffusion 
theory because it suggests that individiuals may modify innovations and use 
them as they -feel appropriate rather than as prescribed by their designer. 
Hamblin, et al . (1973) even provide a mathematical model -for social 
learning. However, what has not been addressed is social forgetting. In 
the proposed model, social learning is represented as (expEbt+dl) and the 
social -forgetting process by the expression (-expEc t + d] ) . Overtime, as ideas 
are no longer invoked they are forgotten by the members of society. They 
are not communicated and newly socialized members cannot observe elders 
working with those ideas. Likewise, academic research may be viewed as 
being forgotten. Paradigms may be disadopted. Cited articles may be no 
longer referenced. Exemplar experiments, taught in laboratories, may be 
dropped in favor of newer models which take advantage of technological 
advances and paradigmatic shifts. This paper presented a model of social 
learning and forgetting. For academia, these results suggest that new ideas 
are learned faster than others are forgotten. This implies the fortunate 
consequence that some ideas are retained allowing academia to acculumlate 
knowl edge . 
Summary 

This paper examined the pattern of diffusion in the academic 
literatures of the Sciences, Social Sciences and Arts and Humanities based 

16 

19 



on citations. An examination of the citations of articles from a given 
year to the year in which the cited article was published revealed a 
pattern. The percentage was initially small when there was no lag between 
the years. It then increases, reaching a peak in less than two years. 
Then, it gradually decreased over time. A mathematical model was developed 
to describe this pattern, which when tested explained between 9<4.9 and 98. 3V. 
of the variance depending on the data set. The proposed model had problems 
due to the he teroscedast i c i ty in the citation variable. The results were 
interpreted as an example of social learning and forgetting. 
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